feat(evaluators): add new lluna client by namrataghadi-galileo · Pull Request #213 · agentcontrol/agent-control

namrataghadi-galileo · 2026-05-06T19:49:12Z

Summary

What changed and why.

Scope

User-facing/API changes:
Internal changes:
Out of scope:

Risk and Rollout

Risk level: low / medium / high
Rollback plan:

Testing

Added or updated automated tests
Ran make check (or explained why not)
Manually verified behavior

Checklist

Linked issue/spec (if applicable)
Updated docs/examples for user-facing changes
Included any required follow-up tasks

codecov · 2026-05-06T19:53:12Z

Codecov Report

❌ Patch coverage is 94.38202% with 25 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
engine/src/agent_control_engine/core.py	75.43%	14 Missing ⚠️
.../agent_control_evaluator_galileo/luna/evaluator.py	97.08%	4 Missing ⚠️
...src/agent_control_evaluator_galileo/luna/config.py	91.89%	3 Missing ⚠️
...src/agent_control_evaluator_galileo/luna/client.py	98.89%	2 Missing ⚠️
...ks/python/src/agent_control/evaluators/__init__.py	66.66%	2 Missing ⚠️

📢 Thoughts on this report? Let us know!

abhinav-galileo · 2026-05-20T18:28:27Z

+def luna_config() -> dict[str, Any]:
+    """Build the direct Luna evaluator config used by the composite control."""
+    config: dict[str, Any] = {
+        "scorer_label": LUNA_SCORER_LABEL,
+        "threshold": LUNA_THRESHOLD,
+        "operator": "gte",
+        "payload_field": "output",
+        "on_error": "allow",
+    }
+    if GALILEO_PROJECT_ID:
+        config["project_id"] = GALILEO_PROJECT_ID
+    return config


payload_field, on_error, and project_id are set at the top level of the evaluator config, but LunaEvaluatorConfig declares none of them and the shared BaseModel uses extra="ignore", so all three are silently dropped. The example demonstrates configuration that has no effect — users following it will believe these settings are honored. Either implement these as first-class fields on LunaEvaluatorConfig (looks intended for payload_field and on_error), or remove them from the example and README.

abhinav-galileo · 2026-05-20T18:28:33Z

            "message": self._truncated_message(result.message),
        }
        metadata = dict(result.metadata or {})
+        metadata["selected_data"] = data


This unconditionally injects the raw selector output (prompts, tool args, model output, potentially PII or secrets) into evaluator metadata for every leaf control. The metadata flows out through EvaluationResponse.matches/errors/non_matches to API clients, and is also fanned out into OTEL span attributes by the SDK's otel_sink (every metadata key becomes agent_control.metadata.<key>). The server's _sanitize_control_match only redacts internal evaluator error strings — it doesn't touch metadata. No size cap, redaction, or opt-out. Gate selected_data behind a config flag with truncation, or extend the sanitizer to scrub it before exporting.

abhinav-galileo · 2026-05-20T18:28:37Z

+dependencies = [
+    "agent-control-sdk",
+    "agent-control-evaluator-galileo",
+]


dependencies declares only agent-control-sdk and agent-control-evaluator-galileo, but [tool.uv.sources] lists six workspace packages. [tool.uv.sources] only overrides sources for already-declared dependencies — it does not add packages. The SDK imports agent_control_telemetry (and other workspace packages) at runtime, but those are not vendored in editable mode, so demo_agent.py's first import will fail when a user runs uv run python setup_controls.py as documented. See examples/google_adk_plugin/pyproject.toml for the correct pattern (all workspace packages listed under both dependencies and [tool.uv.sources]).

abhinav-galileo · 2026-05-20T18:28:40Z

+If the scorer requires explicit project resolution, set:
+
+```bash
+export GALILEO_PROJECT_ID="00000000-0000-0000-0000-000000000000"
+```


The README tells users to set GALILEO_PROJECT_ID for project-scoped scorer resolution, but LunaEvaluatorConfig has no project_id field and setup_controls.py places it at the top level where it gets dropped by extra="ignore". The instruction is misleading. Either implement the field end-to-end through the config and scorer payload, or remove this section.

abhinav-galileo · 2026-05-20T18:28:45Z

+            metadata={
+                "error": error_detail,
+                "error_type": type(error).__name__,
+                "scorer_label": self.config.scorer_label,
+                "scorer_id": self.config.scorer_id,
+                "scorer_version_id": self.config.scorer_version_id,
+            },


Storing error_detail in metadata["error"] lets raw upstream error text bypass the server sanitizer. _sanitize_control_match redacts result.error and metadata["condition_trace"]["error"], but not top-level metadata["error"], so HTTP error bodies, exception messages with URLs, and internal hints leak through to API responses and telemetry. error_type is already in metadata, which is the part callers can safely act on — error_detail is redundant. Drop it, or replace with a safe error code.

abhinav-galileo · 2026-05-20T18:28:49Z

+    def _get_client(self) -> GalileoLunaClient:
+        """Get or create the Galileo Luna client."""
+        if self._client is None:
+            self._client = GalileoLunaClient()
+        return self._client


Evaluator instances are LRU-cached and reused across calls; the base Evaluator docstring explicitly warns against storing mutable request-scoped state on self. _get_client mutates self._client with no synchronization. There is no asyncio race within _get_client itself (no await), but multi-thread or multi-event-loop reuse can construct two clients on the first concurrent call, orphan one of them, and leak its httpx.AsyncClient connection pool. Auth already validates in __init__ — construct the client eagerly there too.

abhinav-galileo · 2026-05-20T18:28:52Z

        metadata = dict(result.metadata or {})
+        metadata["selected_data"] = data
        metadata["condition_trace"] = trace


metadata = dict(result.metadata or {}) then metadata["selected_data"] = data unconditionally overwrites whatever the evaluator deliberately put under selected_data. Evaluators do return that key (see the mock in engine/tests/test_core.py:160), and an evaluator might want to ship a sanitized or transformed version. The contract is not currently clear about who owns the key — consider namespacing the engine-injected value (e.g. engine_selected_data) so it cannot collide with evaluator-supplied metadata.

abhinav-galileo · 2026-05-22T10:07:58Z

+        headers: dict[str, str] | None,
+    ) -> tuple[str, dict[str, str]]:
+        request_headers = dict(headers or {})
+        if self.api_secret is None:


The endpoint/auth mode is selected implicitly from whichever credential is present: if an API secret is set, the client uses the internal route; otherwise it uses the public API-key route. If both credentials are present, the secret path wins silently. Please make this an explicit runtime/client mode, validate that the matching credential is present, and log the selected mode without logging secrets.

abhinav-galileo · 2026-05-22T10:07:58Z

+
+        text = _coerce_payload_text(data)
+        scorer_label = self.config.scorer_label or ""
+        if "output" in scorer_label:


For scalar selected data, input vs output routing is inferred by checking whether the scorer label contains output. That is surprising behavior for a public evaluator contract, especially because the example config suggests there is an explicit payload_field knob. Please make the payload side explicit in config and test it, or document this heuristic very clearly.

abhinav-galileo · 2026-05-22T10:07:58Z

+    if isinstance(score, list):
+        return threshold in score
+    if isinstance(score, dict):
+        if isinstance(threshold, str) and threshold in score:


For dict scores, contains currently matches both keys and values. That can trigger unexpectedly when the threshold appears as a field name rather than as the scorer value. Please either narrow this to values-only or document and test the intended semantics explicitly.

…new-luna-client

abhinav-galileo · 2026-05-26T15:01:52Z

+        for preview_key in ("engine_selected_data_preview", "selected_data_preview"):
+            preview = metadata.get(preview_key)
+            if isinstance(preview, dict) and "value" in preview:
+                safe_metadata["input"] = preview["value"]


This is a silent regression in the privacy posture of event metadata. _safe_event_metadata deliberately strips the four *_data* keys to keep selected-data out of observability sinks, but the line below copies preview["value"] into metadata["input"], which is not in _DEBUG_METADATA_ATTRIBUTE_KEYS. The OTEL sink (otel_sink.py) re-exports every metadata key as agent_control.metadata.<key>, so the same preview that was just filtered out leaks back through under a new name.

The preview's own redaction is shallow:

Selector paths that resolve to a primitive (e.g. input.api_key) return a plain string. _truncate_string only truncates, it never redacts, so a 32-char secret lands verbatim on the span.

Dict redaction is key-name substring matching against a 7-entry list; nested sensitive strings (e.g. {"prompt": "my password is hunter2"}) pass through.

Suggestions:

Drop the synthesized input field entirely, or

Make it explicit opt-in via an env flag (mirroring AGENT_CONTROL_INCLUDE_RAW_SELECTED_DATA), or

Use a namespaced key like engine_input_preview and add it to the OTEL strip set.

We need to keep this behavior for now because the UI’s control-span Messages/Input view depends on the synthesized metadata["input"] field. The control span search result exposes input as a first-class display field, and in the current OTEL ingestion path this value is derived from the exported control event metadata. Removing the alias would preserve the stricter privacy posture, but it would also regress the Messages view by no longer showing the selected input text for control spans.

We can wait for the users to come back on this to us. All other spans show raw data in the Messages View in UI

abhinav-galileo · 2026-05-26T15:02:00Z

+def _has_text(value: str | None) -> bool:
+    return value is not None and value != ""


_has_text only rejects "", but the client's _has_value strips whitespace before testing. So " " bypasses the "No data to score with Luna" shortcut, reaches client.invoke, and raises ValueError("At least one of input or output must be provided."). The outer except records that as an evaluator error.

For deny controls, engine.process() flips is_safe=False when a deny control errors, so whitespace data fails closed, the opposite of the explicit no-data path. Align with the client's semantics:

def _has_text(value: str | None) -> bool: return value is not None and value.strip() != ""

The existing test_evaluator_does_not_call_api_for_empty_data only covers ""; add a " " case.

Changed _has_text to use value.strip() != "", matching the client’s _has_value semantics. I also parameterized the existing empty-data test in test_luna_evaluator.py (line 443) so both "" and " " take the no-data path and do not call the API.

abhinav-galileo · 2026-05-26T15:02:07Z

+    def _derive_api_url(self, console_url: str) -> str:
+        """Derive the API URL from a Galileo Console URL."""
+        url = console_url.rstrip("/")
+
+        if "console." in url:
+            return url.replace("console.", "api.")
+        if "console-" in url:
+            return url.replace("console-", "api-", 1)
+
+        if url.startswith("https://"):
+            return url.replace("https://", "https://api.")
+        if url.startswith("http://"):
+            return url.replace("http://", "http://api.")
+
+        return url


The two in checks operate on the entire URL, not on the hostname prefix. A few cases this gets wrong:

https://my-console.example.com becomes https://my-api.example.com instead of being left alone (or being something sensible like https://api.my-console.example.com).

https://app.galileo.ai/console.html rewrites the path: the console. in /console.html triggers the substitution and yields https://app.galileo.ai/api.html. The host isn't even touched.

The companion tests in tests/test_luna_coverage_gaps.py only exercise canonical Galileo hostnames (console.galileo.ai, console-staging.galileo.ai) and a couple of plain example.com hosts, so the bug surface is invisible to CI.

Fix:

from urllib.parse import urlsplit, urlunsplit parts = urlsplit(url) host = parts.hostname or "" if host.startswith("console."): new_host = "api." + host[len("console."):] elif host.startswith("console-"): new_host = "api-" + host[len("console-"):] else: ... # decide on fallback policy explicitly

Operate on the hostname only, and gate substitution on startswith.

Separate but related: the fallback at the bottom (https://example.com -> https://api.example.com) silently rewrites the host of any non-console URL. The plain-host tests codify it as intended, but it is surprising behavior for users supplying an explicit GALILEO_CONSOLE_URL to their own infrastructure. Worth confirming this is intentional and documenting it.

Implemented the URL derivation fix in client.py (line 255). It now parses with urlsplit, rewrites only hostname prefixes console. and console-, and preserves the existing fallback of prefixing non-console HTTP(S) hosts with api.. I added docstring text to make that fallback explicit.

abhinav-galileo · 2026-05-26T15:02:15Z

+    def _handle_error(
+        self,
+        error: Exception,
+    ) -> EvaluatorResult:
+        error_detail = str(error)
+        return EvaluatorResult(
+            matched=False,
+            confidence=0.0,
+            message=f"Luna evaluation error: {error_detail}",
+            metadata={
+                "error_type": type(error).__name__,
+                "scorer_label": self.config.scorer_label,
+                "scorer_id": self.config.scorer_id,
+                "scorer_version_id": self.config.scorer_version_id,
+            },
+            error=error_detail,
+        )


On any exception, _handle_error returns matched=False, confidence=0.0. luna2 already exposes on_error: 'allow' | 'deny' for safety-critical flows, so users mixing both evaluators get inconsistent failure semantics.

Partial mitigation today: for deny controls, engine.process() sets is_safe=False when a deny control errors (see engine/core.py around line 765), so the safety-critical path is already fail-closed. The gap is real for steer/observe actions and for parity with luna2.

Consider adding:

on_error: Literal["allow", "deny"] = "allow"

to LunaEvaluatorConfig and honoring it in _handle_error. Note that EvaluatorResult forbids matched=True when error is set, so the deny variant should still produce matched=False but with metadata the runner interprets as fail-closed (same contract as luna2).

I don’t think we should add on_error to Luna in this patch.

The current engine contract treats evaluator failures through EvaluatorResult.error: when error is set, matched must be False, and deny controls already fail closed at the engine level when a deny evaluation errors. For steer/observe, adding on_error="deny" only as metadata would not change behavior today because the engine does not interpret fallback_action metadata. To make it behaviorally effective, Luna would either need to suppress error and synthesize a matched result, which loses the evaluator-health signal, or we would need a shared engine/model contract for “error fallback action.”

add new lluna client

9f1791a

namrataghadi-galileo changed the title ~~feat(evaluator): add new lluna client~~ feat(evaluators): add new lluna client May 7, 2026

namrataghadi-galileo added 21 commits May 7, 2026 16:51

fix the url

8d2227d

Merge branch 'main' into feature/64546-add-new-luna-client

db02db8

feat(galileo): support internal scorer auth

0cce0bf

add auth and update schema

dd252be

fix(galileo): align luna scorer response schema

74fcbeb

update the schemas and corresponding tests

7b0a15d

update the schemas for scorer

523524d

update luna client schemas

34f430d

fix tests

ad0b2dc

remove unwanted fields

81cea04

remove project_id from evaluator config

f3cf8f7

Merge branch 'main' into feature/64546-add-new-luna-client

9cf463a

add evaluator context

025f96f

remove evaluation context

15e7a01

add tests for coverage

a06d3f1

coverage

c8662f7

move coervagera

9a94bba

failing test

f0d11b7

add input text that goes into controls evaluators

f683dda

add docstring

7210fc1

Merge branch 'main' into feature/64546-add-new-luna-client

81a5f1d

abhinav-galileo reviewed May 20, 2026

View reviewed changes

abhinav-galileo reviewed May 22, 2026

View reviewed changes

namrataghadi-galileo and others added 7 commits May 22, 2026 10:04

Merge branch 'main' into feature/64546-add-new-luna-client

2c1b488

feat(server): add control clone-and-bind endpoint

18fe79f

fix(server): bound control attachment expansion

58d9e14

address comments

f513dac

Merge branch 'feature/control-clone-and-bind' into feature/64546-add-…

c82de28

…new-luna-client

merging from main

7a237da

fix case where input does not show text

916dcde

abhinav-galileo reviewed May 26, 2026

View reviewed changes

namrataghadi-galileo added 2 commits May 26, 2026 09:19

address comments

8c522b8

linting

e5d16a4

		def _has_text(value: str \| None) -> bool:
		return value is not None and value != ""

Conversation

namrataghadi-galileo commented May 6, 2026

Summary

Scope

Risk and Rollout

Testing

Checklist

Uh oh!

codecov Bot commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented May 6, 2026 •

edited

Loading